Change Detection Meets Visual Question Answering
نویسندگان
چکیده
The Earth's surface is continually changing, and identifying changes plays an important role in urban planning sustainability. Although change detection techniques have been successfully developed for many years, these are still limited to experts facilitators related fields. In order provide every user with flexible access information help them better understand land-cover changes, we introduce a novel task: detection-based visual question answering (CDVQA) on multi-temporal aerial images. particular, images can be queried obtain high level change-based according content between two input We first build CDVQA dataset including image-question-answer triplets using automatic question-answer generation method. Then, baseline framework devised this work, it contains four parts: feature encoding, fusion, multi-modal answer prediction. addition, also enhancing module aiming at incorporating more change-related information. Finally, effects of different backbones fusion strategies studied the performance task. experimental results useful insights developing models, which future research
منابع مشابه
Machine Translation Evaluation Meets Community Question Answering
We explore the applicability of machine translation evaluation (MTE) methods to a very different problem: answer ranking in community Question Answering. In particular, we adopt a pairwise neural network (NN) architecture, which incorporates MTE features, as well as rich syntactic and semantic embeddings, and which efficiently models complex non-linear interactions. The evaluation results show ...
متن کاملInvestigating Embedded Question Reuse in Question Answering
The investigation presented in this paper is a novel method in question answering (QA) that enables a QA system to gain performance through reuse of information in the answer to one question to answer another related question. Our analysis shows that a pair of question in a general open domain QA can have embedding relation through their mentions of noun phrase expressions. We present methods f...
متن کاملRevisiting Visual Question Answering Baselines
Visual question answering (VQA) is an interesting learning setting for evaluating the abilities and shortcomings of current systems for image understanding. Many of the recently proposed VQA systems include attention or memory mechanisms designed to support “reasoning”. For multiple-choice VQA, nearly all of these systems train a multi-class classifier on image and question features to predict ...
متن کاملiVQA: Inverse Visual Question Answering
In recent years, visual question answering (VQA) has become topical as a long-term goal to drive computer vision and multi-disciplinary AI research. The premise of VQA’s significance, is that both the image and textual question need to be well understood and mutually grounded in order to infer the correct answer. However, current VQA models perhaps ‘understand’ less than initially hoped, and in...
متن کاملSpeech-Based Visual Question Answering
This paper introduces the task of speech-based visual question answering (VQA), that is, to generate an answer given an image and an associated spoken question. Our work is the first study of speechbased VQA with the intention of providing insights for applications such as speech-based virtual assistants. Two methods are studied: an end to end, deep neural network that directly uses audio wavef...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Geoscience and Remote Sensing
سال: 2022
ISSN: ['0196-2892', '1558-0644']
DOI: https://doi.org/10.1109/tgrs.2022.3203314